Nonintrusive Failure Detection and Recovery for Internet Services Using Backdoors

نویسندگان

  • Florin Sultan
  • Aniruddha Bohra
  • Yufei Pan
  • Stephen Smaldone
  • Iulian Neamtiu
  • Pascal Gallard
  • Liviu Iftode
چکیده

We describe an architecture for nonintrusive failure detection and recovery in a cluster of Internet servers in which nodes mutually monitor their liveness and recover client sessions from failed nodes. The system is based on Backdoors, a novel architectural approach for remote healing of computer systems. Backdoors enables monitoring and recovery/repair of state in a computer system by remote access to system resources (memory, I/O devices) without using its processors. Backdoors allows remote actions to be performed with no overhead, and even when the processors (but not the memory) of a machine are not available. We have implemented a Backdoors prototype by modifying the FreeBSD kernel and using Myrinet NICs for remote access. The system uses remote DMA operations to perform monitoring, detect failures and extract OS and application state from a failed machine. We have used our system to run several open-source Internet servers and to run a complex multi-tier e-commerce application. The system tolerates multiple node failures while providing correct and continuous service to ongoing sessions, with negligible disruption.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Strategy Formulation for Service Failure Recovery, Using Mixed Research Method

The purpose of this study is to explain the strategies affecting the failure recovery in significant services which researches had previously disregarded. Since more than half of the total global wealth comes from the service sector, this study gains importance. Service failures and failed recoveries are among the leading causes of customer switching behavior from service organizations. The exi...

متن کامل

Recovering Internet Service Sessions from Operating System Failures Motivation and Approach

Critical Internet services such as ecommerce, online auctions, and banking run on complex, multi-tier architectures built with commodity (offthe-shelf) machines and operating systems. These stateful services are sensitive to server failures: active client sessions on these servers are lost, although the state associated with them might still be intact in a failed machine’s memory. We developed ...

متن کامل

Emulation-based Evaluation of an Architecture for Wide-Area Service Composition

Service composition provides a flexible way to quickly enable new application functionalities using component services. We focus on the scenario where next generation portal providers “compose” the services of other providers. We have developed an architecture based on an overlay network of service clusters to provide failure-resilient composition of services across the wide-area Internet: our ...

متن کامل

An architecture for highly available wide-area service composition

Service composition provides a flexible way to quickly enable new application functionalities in next generation networks. We focus on the scenario where next generation portal providers “compose” the component services of other providers. We have developed an architecture based on an overlay network of service clusters to provide failure-resilient composition of services across the wide-area I...

متن کامل

Nonintrusive Remote Healing Using Backdoors

In this paper, we propose a remote healing approach for computer systems based on backdoors, a system architecture that supports monitoring and repair actions on a remote operating system or application memory image without using the processors of the target machine. A backdoor can be implemented using the remote memory communication technology provided by communication standards like Virtual I...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004